Spread Lips + Raised Larynx + Higher F0 = Smiled Speech? – An Articulatory Synthesis Approach

نویسندگان

  • Eva Lasarcyk
  • Jürgen Trouvain
چکیده

We present an initial study on how to model smiled speech with an articulatory speech synthesizer, led by the research question as to what cues are responsible for the effect of an audible distinction of smiled vs. non-smiled speech. In a perception test, we explore the relative contributions of i) spreading of the lips, ii) raising of the larynx, and iii) raising of the fundamental frequency. 36 test subjects assessed isolated synthetic vowel stimuli of /a:, i:, y:, u:/ on a 5-point “smiley scale”. Results indicate that F0 is the main acoustic factor for perceiving smileyness. The other factors depend on the vowel quality, with best results for the unrounded vowels /i:/ and /a:/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vocal pitch discrimination in the motor system.

Speech production can be broadly separated into two distinct components: Phonation and Articulation. These two aspects require the efficient control of several phono-articulatory effectors. Speech is indeed generated by the vibration of the vocal-folds in the larynx (F0) followed by ''filtering" by articulators, to select certain resonant frequencies out of that wave (F1, F2, F3, etc.). Recentl...

متن کامل

Articulatory synthesis from x-rays and inversion for an adaptive speech robot

This paper describes a speech robotic approach to articulatory synthesis. An anthropomorphic speech robot has been built, based on a real reference subject’s data. This speech robot, called the Articulotron, has a set of relevant degrees of freedom for speech articulators, jaw, tongue, lips, and larynx. The associated articulatory model has been elaborated from cineradiographic midsagittal prof...

متن کامل

Acoustics vs. articulation in articulatory speech synthesis: One vocal tract target configuration has more than one sound

for ESSV 2010 (by Eva Lasarcyk): Acoustics vs. articulation in articulatory speech synthesis: One vocal tract target configuration has more than one sound. The goal of this contribution is to illustrate the importance of the acoustic settings of articulatory speech synthesis when using it for perception/validation experiments regarding the relationship between articulation and fine phonetic det...

متن کامل

The Sound of Deception - What Makes a Speaker Credible?

The detection of deception in human speech is a difficult task but can be performed above chance level by human listeners even when only audio data is provided. Still, it is highly contested, which speech features could be used to help identify lies. In this study, we examined a set of phonetic and paralinguistic cues and their influence on the credibility of speech using an analysis-by-synthes...

متن کامل

Acoustic to articulatory inversion

The context of this work is speech analysis. The subject deals with acoustic-to-articulatory inversion, i.e. the recovery of the temporal evolution of the vocal tract shape from the signal. This topic is important because it is likely to give rise to applications in the domains of speech coding as well as second language learning. Acoustic-to-articulatory inversion relies on an analysis by synt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008